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Abstract. We analyze the mixing time of a natural local Markov chain 
(the Glauber dynamics) on configurations of the solid-on-solid model of 
statistical physics. This model has been proposed, among other things, 
as an idealization of the behavior of contours in the Ising model at low 
temperatures. Our main result is an upper bound on the mixing time 
of 0(n 3 s ), which is tight within a factor of 0(y/n). The proof, which 
in addition gives some insight into the actual evolution of the contours, 
requires the introduction of a number of novel analytical techniques that 
we conjecture will have other applications. 
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1. Introduction 

In the n x n solid-on-solid (SOS) model Il22l [23], a configuration is an 
assignment of an integer height r](i) e [0, nj^jto each of n positions i e [1, n], 
with fixed boundary conditions ry(0) = rj(n + 1) = 0. The probability of a 
configuration is given by the Gibbs distribution 

fJ'iv) = Zp 1 exp {-PYsTi ^ ~ 1} " • (L1) 

Here (3 is a positive parameter and Zp is a normalizing factor (the "parti- 
tion function"). Thus a configuration -q = {r/(i)} of the SOS model may be 
pictured as an interface or contour with fixed endpoints (0, 0) and (n + 1, 0) 
(see Fig. [IJa)). Notice that the Gibbs distribution favors contours that are 
"smooth" (i.e., have no large jumps in height), this bias being more pro- 
nounced for larger values of (3. Moreover, the contour can be thought of as 
the path of an n-step random walk with independent geometric increments, 
conditioned to be positive and smaller than n and to return to the origin at 
time n. Therefore, its typical maximum height will be of order ^Jn. 

In this paper we analyze the (discrete time) Glauber dynamics for the 
SOS model. This is a natural local Markov chain on configurations whose 
transitions update the height at a randomly chosen position i from rj{i) to 
r]{i) ± 1; the transition probabilities are chosen so that the dynamics is re- 
versible w.r.t. the Gibbs distribution fi and thus converges to it from any ini- 
tial configuration. Our goal is to determine the mixing time, i.e., the number 
of steps until the law of dynamics is close to its equilibrium distribution /i in 
variation distance. 

Although Markovian dynamics for the SOS and related models have been 
studied extensively in many contexts connected with the behavior of random 
surfaces (see, e.g., IIT0llTTl[T2"ll2TTl '). to the best of our knowledge the mixing 



Throughout the paper, [a, 6] will denote the integer points in the interval [a, b]. 




Figure 1 . (a) A contour in the SOS model, (b) A contour in 
the Ising model. 
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time has not been rigorously analyzed. There are at least three motivations 
for studying this question, which we now describe. 

The first motivation comes from the tight connection with the more fa- 
miliar (two dimensional) Ising model, whose Glauber dynamics has been 
the focus of much attention in both statistical physics and computer science 
(see, e.g., I© El [171 [181 [161 HI H7D D - 

In the Ising model in an n x n box A n C I?, the configurations are assign- 
ments a of spin values { + ,-} to the vertices of A n . The Gibbs distribution 
is fi(a) = Zp 1 exp(—(3D(a)), where D(a) is the number of neighbors in A n 
whose spins differ and (3 is inverse temperature. The (heat-bath) Glauber 
dynamics runs as follows: at each time step a random vertex i e A n is chosen 
and its current spin value is replaced by a new value sampled from the equi- 
librium distribution at i given the neighboring spins. A variety of techniques 
have been introduced in order to analyze, at increasing levels of sophistica- 
tion, the typical time scales of the relaxation process to the reversible Gibbs 
measure (see, e.g., lfT6l |61 [181 [T4ll ). These techniques have proved to be 
quite successful in the so-called "one-phase" region, corresponding to the 
case when the system has a unique Gibbs state. When instead the thermo- 
dynamic parameters of the system correspond to a point in the "phase coex- 
istence" region, a whole class of new dynamical phenomena appear (such 
as coarsening, phase nucleation, and motion of interfaces between different 
phases) whose mathematical analysis at a microscopic level is still far from 
complete. 

One of the most important and fundamental open problems is that of 
proving a polynomial (in ri) upper bound on the mixing time at low tem- 
peratures (large j3), when the boundary conditions around the edges of A n 
are fixed to be + and hence force the system into the + phase^J (We remark 
that even the proof of a lower bound, usually a much simpler task, requires 
all the heavy technology of the Wullf construction and the associated large 
deviation theory [4].) The above question is easily reduced to the following 
problem: if the box is initially filled with - spins, how long does it take until 
this large region of - is destroyed under the influence of the + boundary 
conditions and replaced by an equilibrium configuration? This in turn is 
equivalent to the question of how the outer contour of the - region con- 
tracts towards the center of the box. For large j3, it is conjectured ITT3TI that 
the contour evolves according to a mean curvature motion and therefore 
should disappear in polynomial time 0(n 4 ) (independent of flj^ however, 
until very recently only very weak upper bounds of the form exp(0(n 1 / 2+<E )) 



It is worth mentioning that when the underlying graph Z 2 is replaced by a regular tree or 
hyperbolic graph, then optimal 0(n log n) bounds on the mixing time [ 18] or on the spectral 
gap (3] have been established. 

3 In continuous time the corresponding scaling should be 0{n 2 ), apart from possible log- 
arithmic corrections. 
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were known 111611 (except in the qualitatively different zero temperature 
case, which is analyzed in flT]]). 

The SOS model has been proposed II23H as an idealized model of this Ising 
contour, in which we think of the sites above and below the SOS contour as 
being + and - respectively. (Note that the sum ]T\ \rj(i — 1) — rj(i)\ in the 



Gibbs distribution (1.1) is, up to an additive constant, exactly D(a) under 
this interpretation.) The mixing time is essentially the number of steps until 
the maximal contour (i.e., with r/(i) = n for 1 < i < ri) drops down close 
to the bottom of the box under the influence of the boundary conditions of 
height 0. The main simplification here is that, unlike the Ising model, the 
SOS contour has no "overhangs" (see Fig. [T](b)). However, for large /3 one 
may hope that overhangs are rare, so the approximation should give useful 
insight into the behavior of the true Ising contour (see |[9j for much more 
on this point) . One of our principal motivations in this paper is to introduce 
techniques that may find application to the Ising model. Indeed, this has 
already occurred, as the very recent paper ||T9ll builds on some of the ideas 
and techniques of the present paper to obtain an upper bound of exp(0(n e )) 
(for arbitrary e > 0) on the mixing time of the Ising model at low tempera- 
tures with + boundaries, a substantial improvement on the exp(0(n 1 / 2+e )) 
bound mentioned earlier though still quite far from polynomial. 

The second motivation comes from general polymer models l!T2ll . and 
their natural Glauber-type evolutions (e.g., ll28l[T5l l5Tl). In these models the 
"polymer" is just the n-step path of some type of random walk, starting and 
ending at zero and constrained to stay non-negative. The associated Gibbs 
distribution is simply that induced by the probability distribution of the ran- 
dom walk. An additional interaction, or pinning, between the polymer and 
the line or "wall" at height zero can also be included, and the nature of this 
interaction (attractive, repulsive or even random) plays a crucial role. The 
Glauber dynamics can be defined in analogous fashion to the one studied 
in this paper. When the increments of the random walk are i.i.d. ±1 ran- 
dom variables (with or without a bias), the mixing time of the associated 
Glauber chain has been analyzed quite precisely in various cases using the 
so called "Wilson method" fl5] . When instead the increments are no longer 
uniformly bounded, as is the case for the SOS interface in this paper, a rigor- 
ous analysis of the associated Glauber dynamics apparently becomes much 
more challenging. 

This brings us to our third motivation, which stems from the challenge 
that the SOS model poses to standard techniques. The two most natural 
approaches to estimating the mixing time seem to be the following: 

1 . Coupling. One might hope that, under the natural monotone coupling of 
the SOS model (see Section [2]), the expected Hamming distance between 
two coupled copies of the dynamics is non-increasing. This would lead to a 
mixing time bound of 0(n 5 )J^] which as we shall see is rather weak and also 

throughout the paper the notation O(-) hides factors of poly log (n). 
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gives little insight into the evolution of the contour. In fact even this is not 
always true (the distance may increase in expectation in some cases), and a 
direct approach based on monotone coupling remains elusive. 

2. Comparison. Another standard approach is to first analyze a "non-local" 
dynamics in which transitions are allowed to update the height rj(i) to any 
value in [0,n]. Typically, non-local dynamics are easier to analyze precisely 
(see, e.g., IIT51I2810 . One can then use the machinery of Diaconis and Saloff- 
Coste [8] to relate the mixing time of the local dynamics to that of the 
non-local one, as was done, for example, by Randall and Tetali 11251 for the 
related "lozenge tilings" model. However, since such comparisons proceed 
via the spectral gap, they are usually quite wasteful; in particular, for the 
SOS model this approach leads to a mixing time of 0(n 8 ). 

In this paper we aim for a more refined analysis that gives, in addition to an 
almost tight bound, greater insight into the actual evolution of the contour 
in the SOS model. Our main result is the following: 

Theorem 1.1. For the n x n SOS model at any inverse temperature (3 > 0, 
the mixing time is 0(n 3 - 5 ). 



The bound on mixing time is tight up to a factor of ^/n (and logarithmic 
factors), as a lower bound of £l(n 3 ) follows from straightforward arguments 
(see Theorem |3 . 6| below) . 

The high level strategy of our analysis is as follows: 

(a) We first prove (see section 4.2 ) that in 0(n 3 5 ) steps the maximal config- 



uration (i.e., the one in which the contour has height n everywhere) reaches 
equilibrium. This analysis in turn is split into 0(a/^) repetitions of a basic 
key result which says that, starting in equilibrium but conditioned to be 
above height 0(\/n), in time 0(n 3 ) the system reaches equilibrium (see 



Theorem |4.7| ). This result allows us to bring the original contour at height n 
down to equilibrium in a sequence of 0(^/n) stages, each of which runs in 
0(n 3 ) steps and decreases the height by -Jn. 



(b) We then (see section |4.3| ) analyze the time to reach equilibrium when the 
initial configuration is the minimal one (where the height is everywhere), 
and show that 0(n 3 ) steps suffice. 

The results of (a) and (b) immediately imply, by standard results on mono- 
tone coupling, that the mixing time is <D(n 3 - 5 ). 

Our analysis in the key intermediate result of part (a), and also in part (b), 
rests on the following four essential ingredients: 

(i) First, we give a tight analysis of the non-local dynamics ment ioned above, 
showing that its mixing time is 0(n 3 log n) (see Theorem 3.1). This analy- 



sis, which we believe to be of independent interest, follows an idea of Wil- 
son, developed in the context of the lozenge tilings model [28], in using 
an eigenvector of the discrete Laplacian to obtain a contraction in distance. 
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However, to get this approach to work in our setting we need to bound a 
certain "entropy repulsion" effect due to the height barriers at and n (see 



Lemma 3.2). 



(ii) We then relate the local to the non-local dynamics using a recent "cen- 
soring inequality" of Peres and Winkler Il20ll . which says that censoring (i.e., 
not applying) some subset of updates in a monotone dynamics can only in- 
crease the distance from stationarity. This allows one to simulate a single 
move of the non-local dynamics, at position i, by censoring all local moves 
except those that update r)(i); by the censoring inequality, this can only in- 
crease the mixing time. As a result, the mixing time of the local dynamics 
is bounded above by that of the non-local dynamics times a factor related 
to the mixing time of the one-dimensional local process within the ith "col- 
umn". Essentially, censoring allows us to "schedule" the updates and thus 
maintain detailed control of the shape of the contour. 

(iii) A naive application of the censoring inequality would entail a substan- 
tial overhead of 0(n 2 ) due to the mixing time within a column, which is 
essentially the square of the maximum height difference between the two 
neighboring columns. To overcome this, we need to control the height dif- 
ferences, or "gradients" along the contour. For this purpose, we work with a 
sequence of "bounding dynamics" with gradually decreasing boundary con- 
ditions (these correspond to the 0{y/n) repetitions of the basic result men- 
tioned earlier); since the boundary conditions are — intuitively at least — the 
source of large gradients, this gives us control of the gradients. As a result, 
we are able to cut the simulation overhead between the local and non-local 
dynamics to 0(yfn). We note that this sequence of bounding dynamics cap- 
tures some of the intuition about the actual evolution of the contour. 

(iv) Making rigorous the above bound on gradients requires detailed infor- 
mation about the non-equilibrium shape of the contour, which is notoriously 
difficult to obtain. We get around this difficulty by starting the bounding 
dynamics in equilibrium, but conditioned on a certain rare event A. (The 
conditioning is necessary to ensure that the bounding property holds.) By 
choosing A such that its probability, though tiny, is nonetheless much larger 
than the probability of large gradients in equilibrium, we are able to argue 
that large gradients do not occur during the evolution. This technique is 
isolated in Lemma R~2| 



2. Preliminaries 

Gibbs distribution. We denote by Q n = [0, n] n the set of all configurations 
V = {^WliLi of the n x n solid-on-solid model, as defined in the Introduc- 
tion. The probability of a configuration r\ is given by the Gibbs distribution 
defined in equation ( l.ip . This distribution induces a conditional distribu- 
tion on the height rj(i) at position i, given the heights i](i±l) at its neighbors, 
as follows. Let a = min{r/(i — 1), rj(i + l)}, b = max{r?(i — 1), rj(i + 1)}. Then 
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Hab(j) := Pr[rj(i) = j \ a, b] is given by 



' e -P(b-a)-2P{a-j) ft if <i<a; 

Mi) = I e-^ b ~ a )/Z ifa<i<6; 

^ e -p(b-a)-2M-b) j Z if 6 < j < n , 



(2.2) 



where Z = Zp is a normalizing factor. Note that /x a & is uniform on the 
interval [a, b] and decays exponentially (at a rate depending on /?) outside 
it. 

Single-site dynamics. Our goal is to analyze the single-site Glauber dynam- 



ics] 5 which is a reversible Markov chain M.T on Q n with transitions defined 



as follows, where rj = rjt denotes the current configuration at time t: 

(1) Pick % e [1, n] u.a.r. 

(2) Let rj~, rj + be the configurations obtained from r/ by replacing r](i) 
by max{r/(z) — 1,0} and mm{i](i) + 1, n} respectively. Set r/ t+ i equal 
to rf~ or r/ + with probabilities p~, p + respectively, determined as 
follows (where a, b are the minimum and maximum heights of the 
neighbors, as above): if rj(i) < a then p~ = \e~ 2/3 , else p~ = |; 
if f]{i) > b then p + = |e~ 2/3 , else p + = i. With the remaining 
probability 1 — (p~ +p + ), set r/ t+ i = rj. 

It is standard that is an ergodic, reversible Markov chain that converges 
to the stationary distribution /i on Q n . Our goal is to estimate its mixing 
time, i.e., the number of steps required for the distribution to get close (in 
variation distance) to \i from an arbitrary initial configuration. 

Column dynamics. We will analyze by first analyzing a related Glauber 
dynamics M. c ° l that makes non-local moves. (The term "column" refers to 
the set [0, n] of possible heights at If the configuration at time t is rjt = rj, 
M^ x makes a transition as follows: 

(1) Pick i G [l,n] u.a.r. 

(2) For each j G [0, n], let denote the configuration obtained from r/ 
by replacing r](i) by j. Set j]t+\ = ff with probability proportional 



Mn is again ergodic and reversible with stationary distribution fi. Note 
that both A4n and M™ 1 update the height at a randomly chosen position i 
in a manner that is reversible w.r.t. the conditional distribution Q2.2D . The 
difference is that considers only local moves (changing the height by 
±1), while M.™ 1 allows the height at i to be set to any value. Accordingly, 
we call -A/fJf 1 the "column dynamics" and the "single-site dynamics." 

Mixing time. Let M. by any reversible Markov chain on VL n with stationary 
distribution [i. Following standard practice, we measure the convergence 



This dynamics is sometimes called the "heat-bath" dynamics; this distinction is unimpor- 
tant for our purposes. 



to \i{rf). 
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rate of M via the quantity 

t m( £ ) = min{t : — p,\\ < e V£ G $7„}, 

where i/| denotes the distribution of the configuration at time t starting 
from configuration £ at time 0, and || • || denotes variation distance. Thus 
tm (e) is the number of steps until the variation distance from /i drops to e, 
for an arbitrary initial configuration. For definiteness we define the mixing 
time as t™ x = Tj^{\l2e); it is well known (see, e.g., flU), that r^(e) < 
fine" 1 ] x t$ x for all e > 0. 

Monotonicity and coupling. We define a natural partial order on f2 n as 
follows: for configurations ij,^ £ VL n , we say that r\ < £ iff r\{%) < 
for all i G Note that ^ has unique maximal and minimal elements 

r/ max and 7? mm given by r/ ma - x (i) = n and r] mm (i) = for 1 < i < n. We can 
naturally extend this ordering to probability distributions as follows: for two 
distributions u, \i on Q n , we write v -< \i if for any increasinj^Jfunction / the 
average of / w.r.t. v is less than or equal to its average w.r.t. \x. 

A key fact we shall exploit throughout is the existence of a complete cou- 
pling of the Glauber dynamics (single-site or column) that is monotone 
w.r.t. <. A complete coupling of a Markov chain M. on Q n is a random 
function / : Q n — > ti n that preserves the transition probabilities of M, i.e., 
P^fifiv) = if] = ^Aiiv - > if) f° r a U Vi v' e to n . Note that / simultaneously 
couples the evolution of the Markov chain at all configurations. For the col- 
umn dynamics, we define / as follows. Suppose the current configuration 
is rj: 

(1) Pick % G [1, n] and a real number r G [0, 1] independently and u.a.r. 

(2) Let g(k) = Y^j=o ^ab{j) be the cumulative distribution function of 
the height at position i, given neighboring heights a, b. Set rj'(i) = 
min{£; : g(k) < r}. 

An analogous definition holds for the single-site dynamics. It is simple to 
check that these couplings are monotone w.r.t. the partial order ■<, in the 
sense that if rj t < £,t> and rjt+x,^t+i are the corresponding configurations at 
the next time step under the coupling, then rjt+i d: Ct+i- 

A further standard fact we will need is that the mixing time of the Glauber 
dynamics is bounded above by the time until the coupled evolutions started 
in the two extremal configurations, ?^ max and r/ mm , coincide with constant 
probability. More precisely: 

Proposition 2.1. ||24ll Let (rjf 1 ^), (r/f 1111 ) denote the coupled evolutions of two 
copies of a monotone Glauber dynamics M on Q n started in configurations 
rj max , rj min respectively. Then r M {e) < min{t : Prfrf 13 * / i]f m ] < e}. 

Censoring. In our analysis of the single-site dynamics, we shall also need a 
useful tool from recent work of Peres and Winkler, which says that censoring 

real-valued function / on Sl n is increasing w.r.t. < if t] < £ implies f(rf) < /(£). 
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(i.e., not applying) any subset of updates in the dynamics can only increase 
the distance from stationarity. This so-called "censoring inequality" applies 
to any monotone single-site dynamics. 

Lemma 2.2. Il20ll Suppose a monotone single-site dynamics is started in a 
random initial configuration with distribution uq such that vq/h is increasing 
w.r.t. X. Let v denote the distribution after updates at positions i\,i2, ... ,i m , 
and v' the distribution after updates at a subsequence of these positions ij 1 , ij 2 , 
. . . , ij , (chosen a priori). Then vj\i is increasing and \\v — < \\v' — 

Remark 2.3. 11201 Thm 16.5] states this result for the special case in which 
is concentrated on the maximal state n max . However, it is easy to see that 
the proof requires only the weaker assumption that v^/ptis increasing. More- 
over, by symmetry the lemma clearly also holds with "increasing" replaced by 
"decreasing. " 



The censoring inequality can be used to relate the single-site and column 
dynamics via the following observation. If we censor all moves of the single- 
site dynamics except for those that update a certain position i, then after 
some fixed number of steps T (which depends on the mixing time of the 
single-site dynamics just within the ith column, with its neighbors fixed) we 
will, up to small error, have simulated one move of the column dynamics. By 
Lemma 2^2 the censoring can only slow down convergence of the single-site 
dynamics, so the mixing time of is bounded above by roughly T times 
that of M c n ° l . We shall use a more sophisticated version of this argument in 
Section |4l 



3. The Column Dynamics 

Our goal in this section is to provide a tight analysis of the column dy- 
namics M.™ 1 . Specifically, we will prove: 

Theorem 3.1. For any (3 > 0, the mixing time of the column dynamics M c ° l 
is 0(n 3 logn). 

We believe this result, which we show is tight up to the log n factor (see 



Theorem |3.6| below), is interesting in its own right. It will also be a key 
ingredient in our analysis of the single-site dynamics later. 

Recall that, if the current configuration of M c ° x is r/ t and we choose posi- 
tion i G [1, re] at the next step, then the new height rjt+i (i) is drawn from the 
conditional distribution ( 2.2| ), where a,b are the minimum and maximum 



heights respectively of the neighbors rjt(i ± 1). A key observation is that, 
under such a move, the expected value of the new height rjt+i(i) is close to 
the average of its two neighbors; moreover, the error term satisfies a 
natural ordering property w.r.t. a, b. 
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Lemma 3.2. In the above situation, and assuming a + b < n, the expected 
value of the new height rjt+i(i) satisfies 

Efot+i (i) \a,b} = ^ + s(a, b) , (3.3) 

where e(a,b) > 0. Moreover, e(a,b) < e(c,d) for any pair c,d with c < 
min{a, d} < max{a, d} < b. 

We defer the proof of the lemma, which is somewhat technical, to the 
appendix. However, the intuition is as follows. Note that the distribution of 
r]t+i(i) is uniform on the interval [a, b], and decays symmetrically on either 
side except for the effects of the barriers at heights and n. Thus we would 
expect its mean to be close to The term e(a,b) captures the "entropy 
repulsion" effect of the barriers. This effect is more pronounced for pairs 
(a, b) that are closer to 0, as is the case for the pair (c, d) in the lemma. 



We can derive from Lemma 3.2 the following more symmetrical form that 
allows us to compare the heights of two ordered configurations under the 
monotone coupling (again, see the appendix for a proof). 

Corollary 3.3. Suppose rjt and £t are two configurations satisfying i] t H £ t , 
and let a = min{£ t (i - + 1)}. b = max{&(i - + 1)}, c = 

mm{r] t (i - l),rj t (i + 1)}, d = max{r] t (i - l),r) t (i + 1)}. Then 

< Efe+iW | a,b] -E[ Vt+l (i) \c,d]< a ± b - C ± d . (3.4) 



Armed with Corollary |3.3| we can now proceed to our analysis of 



col 



Proof of Theorem |3. 1\ Following Proposition |2.1[ it suffices to show that two 
coupled copies of M c £ l , started in configurations r^ max and r/ min , will coin- 
cide with constant probability after 0(n 3 logn) steps. Call these two copies 
(i) t max ), (r/f 1111 ) respectively. 

We will measure the distance between r/f ax and ryf 1111 using the quantity 

n 

D(t) = J2w(i)( V r x (i) - Vf m (i)), (3-5) 

2=1 

where w(i) > is a suitably chosen weight function. Note that rf^^ii) > 
rjt^ii) for all i, t by monotonicity so all terms in the sum are non-negative; 
and D(t) = iff iff*** = r?f m . Following an idea of Wilson [*28jl> we choose w 
as the second eigenvector of the discrete Laplacian operator A on [1, n] with 
zero boundary conditions, defined by Ag(i) = — h(g(i + 1) +g(i — 1)) + g(i), 
g(0) = g(n + 1) = 0. It is well known (and easy to verify) that w(i) = 
cos(— | + ^j-) with corresponding eigenvalue A = 1 — cos(^j-) = @(^). 
The reason for this choice is that, by Corollary 3.3| one step of the dy- 



namics behaves very like the Laplacian, so choosing w as an eigenvector 
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of A should give us a contraction of (1 — ^) in D at every step. The argu- 
ment proceeds as follows: 



E^i + lJ-DWIrT.r] 

1 - 

-j2Hi){nvT + r (o i % max (* - i),% max (i + 1)] 



n . 
«=i 



A^K^CO -^(0) = ~-D(t), (3-6) 
n n 

i 



where in the inequality we have used Corollary |3.3[ 

Thus after t steps of the dynamics we have E[D(t)] < (1 - ~)*D(0) < 
(1 — ^)*ra 2 for a constant c > 0. Taking t = t* = c'n 3 log( |) for a sufficiently 
large constant c' ensures that E[jD(t*)] <C Finally, we may bound the 
coupling probability at time i* as follows: 

« * — * 

i 

= (minu;(i))- 1 E[ J D(f )] < e, 

i 

where in the second line we used Markov's inequality and in the last line 
the fact that minj w(i) = cos(— \ H — ^-A = @{\). Thus, by Proposition 



2.1 



t M coi{e) <t* = 0(n 3 log(n/e)). □ 

For our analysis of the single-site dynamics, it will be convenient to in- 
troduce a "parallel" version M%? x of the column dynamics in which all odd- 
numbered (or all even-numbered) positions are updated simultaneously at 
each step. Moreover, since repeated updates of odd or even positions have 
no effect, we may as well assume that odd and even updates alternate. This 
leads to the following definition of .A/f « ai ', in which O, E denote updates of 
all odd and even positions respectively, and the update at any given position 
is performed as in the column dynamics: 

(1) Flip a single fair coin. 

(2) If heads, perform t pairs of odd-even updates (i.e., (OE) 1 ), else if 
tails perform t pairs of even-odd updates (i.e., (EO) f ). 

Note that Mn ar is a convex combination of two reversible Markov chains, 
one performing the update sequence [OE) 1 and the other (EO) 1 . We will 
call these chains M® E and M EO respectively. 
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Following our analysis of M™ 1 , it is straightforward to see that M.??* 
inherits a similar bound on the mixing time, with a factor n speedup coming 
from the parallelization of the updates. 

Theorem 3.4. The mixing time of M?? T is 0(n 2 logn). 



Proof of Theorem 3.4 We use the same distance measure (3.5) as in the 



proof of Theorem 3.1 From equation (3.6) of that proof, we conclude 
that under the t-step evolution of the column dynamics this distance sat- 
isfies E[D(t)} < i J^ii 1 ~ A)'t<;(i)(7 ? S iax (i) - r]^ in (i)), where I is the identity 
operator Ig = g. An analogous calculation for M^T leads to 

E[D(t)] < i^((/-A) 2 ^ + (/-A) 2 *-M«W ax «-% min «) 

i 

< i((l-A) 2 ' + (l-A) 2 *- 1 )Z)(0) 

< (1 - A) 2 * _1 L)(0). 

Using the facts that A = and D(0) < n 2 , and arguing as in the previ- 

ous proof, gives T^p»(e) = 0(n 2 log(n/e)) 3 as claimed. □ 



Remark 3.5. The proofs of Theorems 3.1 and 3.4\ show the stronger results 



that r M coi(e) = 0(n s log(n/e)) and t m p^(e) = 0(n 2 log(n/e)). We shall use 
this result for t m p^(e) in the next section. 

We close this section with a lower bound which shows that the above 
bound on the mixing time of the column dynamics is tight up to the log n 
factor. This lower bound also applies to the single-site dynamics, which will 
imply that our upper bound on its mixing time derived in the next section is 
tight within a factor of 0(y/n), as claimed in the Introduction. 

Theorem 3.6. The mixing times of both M™ 1 and are at least fi(n 3 ). 

Proof. Recall that the spectral gap of a reversible dynamics M is given by 

1. e E v , v ^(v)Pr M [ v ^ v f ](f( v )-f( v ')f 

g&VM ~ 2 f V^U) ' (37) 

where the infimum is over all non-constant functions / : $7 n — > R. As is 
well known (see, e.g., |fl]|), the mixing time is bounded below by gap^j, so 
it suffices to show that gap^ < n~ 3 . Now take the test function f(rj) 
^^(^(^(i + 1) — rj(i — 1)), where w is as in the proof of Theorem 



3.1 



Then straightforward calculations (basically those leading to equ ation Q3.6| ) 
above) show that, for both M.™ 1 and M.^, the numerator of (3.7) is at most 



ci/n 2 and the denominator is at least C2n, for constants ci,C2 > 0. This 
completes the proof. □ 

4. The Single-Site Dynamics 



In this section we prove our main result, Theorem 1.1 of the Introduction, 
which we restate here for convenience. 
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Theorem 4.1. The mixing time of the single-site dynamics M.^ at any inverse 
temperature j3 > Ois 0(n 3 5 ). 

As indicated in the Introduction, we analyze separately the time required 
for maximal and minimal contours to reach equilibrium under a monotone 



complete coupling; by Proposition 2.1 this suffices to bound the mixing time 



We handle the more challenging case of the maximal contour in Section 4.2 



and the minimal contour in Section 4.3 We begin with a basic analytical 



tool that we will use in both parts, which allows us to relate the single-site 
dynamics to the column dynamics analyzed previously. 

4.1. Basic building block. As explained in the Introduction, our main tool 
for analyzing the evolution of the single-site dynamics is to relate it to the 
column dynamics, for which we obtained a tight mixing time analysis in Sec- 
tion [3] To do this we will use the censoring idea explained in Section [2j As 
indicated in the Introduction, the overhead in the mixing time introduced by 
censoring depends crucially on the maximum gradient (or height difference) 
that arises in the dynamics. In this subsection, we show that this overhead 
can be kept very low (polylogarithmic in ri) provided we start the dynamics 
in the equilibrium distribution conditioned on a monotone event A whose 
probability is not extremely small (at least exp(— polylog(n))). In our sub- 
sequent analysis, we will use this basic building block repeatedly by condi- 
tioning on various suitable events A. 

Lemma 4.2. Let A be any increasing or decreasing event, and consider the 
single-site dynamics started from vq := \ A). Denote by vt its distribution 
after t steps. Let D := \log(j^)], and t n> D ■= 2n 3 D 2 \og 8 n. Then for any 
t > t Uj D md any fixed b > Owe have 

\\u t - n\\ = o(l/n b ). 

Remark 4.3. Here and elsewhere in this section, in the interests of clarity of 
exposition we make no attempt to minimize the number of log factors in our 
bounds. In particular, we frequently use a log factor in place of a sufficiently 
large constant. Abo, we generally ignore issues of rounding throughout. 

Proof. We consider only the case of an increasing event A; the decreasing 
case is entirely symmetrical. To bound the mixing time of the single-site 
dynamics, we relate it to the corresponding parallel column dynamics us- 



ing the censoring inequality (Lemma 2.2). Note that this is valid because 



the initial distribution vq satisfies the requirement that i> Q /n = xa/h(A) is 
increasing w.r.t. 

To do this, we split the time t n ,D into M := n 2 log 2 n epochs each of length 
m := 2nD 2 log 6 n. Given i n> £> random positions i = 12, . . . , it n D ) in [1, n], 
the distribution v tn D can be written as the average over i of the distribution 
v- x obtained by applying, in the given order, i n £> single-site updates at po- 
sitions i\, 12, . . . , %t n D . Next we write w(i) = (wi, ...,wjf) by grouping to- 
gether positions in the same epoch. Finally, we define two censored versions 
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of the dynamics as follows. In the first version, we delete all even positions 
from the odd epochs and all odd positions from the even epochs; denote 
the resulting censored vector OE(i) = (OEi, . . . , 0~Em) and the associated 
distribution ^oE(i)- I n the second version, we reverse the roles of odd and 
even and denote the resulting censored vector EO(i) = (EOi, . . . , EOm) 
and the associated distribution fEo(i)- 
This construction gives us 

\Wt nD - At 1 1 = || Avi j/i - fi\\ < Avi \\ui - fi\\ < Av ; |||(z/ E(i) + ^EO(i)) - Mil, 

(4.8) 

where the last step relies on the censoring inequality. Note that the expected 
number of times any position i appears in i is m/n = 2D 2 log 6 n. Hence a 
standard Chernoff bound guarantees that, apart from an error that is expo- 
nentially small in log 6 n (and hence certainly o(l/n b )), the r.h.s. of Q4.8[ ) is 
bounded above by 

max||i(i/ E(i) + ^EO(i)) - Mil, (4-9) 

where X consists of all i such that the censored vectors OE(i) and EO(i) 
contain at least D 2 log 6 n updates of every position i £ [1, n] in every epoch 
ke{l,...,M}. 

Now we claim that, for i e X, the distribution ^(^oE(i) + ^EO(i)) is ver y 
close to the distribution at time M = n 2 log 2 n of the parallel column dy- 
namics Ain ar , with the same initial distribution. To establish this, we need 
to show that D 2 log 6 n single-site updates at position i, with its neighboring 
heights fixed, are enough to simulate (with small error) one column up- 
date at i. This relies crucially on the fact that Mn ai is unlikely to produce 
configurations with large gradients, which we define to be at least D log 2 n. 
Accordingly, define the set of "bad" configurations 

B = {ij : \r](i + 1) — r](i)\ > Dlog 2 n for some i G [0, n]}. 

Claim 4.4. For i e S, we have 

II^OE(i) + ^EO(i)) " i&l ^ M(maxK OE (i?) + uf°(B)} + e -^ 2 ")), 

where v® E and vf° denote the distributions of A4° E and A4 EO respectively 
after s steps, starting from vq. 

The intuition for this Claim, which is proved formally in the appendix, is 
the following. The first term on the r.h.s. bounds the probability of seeing 
a bad configuration in Mn ar , so we may assume that r/ ^ B. A sequence of 
single-site updates at position i (with its neighboring heights a, b fixed) can 
be viewed as a nearest-neighbor random walk on column i with stationary 
distribution equal to the distribution of a column update. This distribution 
(see Q2.2[ )) is uniform on the interval [a, b] and decays exponentially outside 
it. Hence the mixing time of this random walk, starting from a position at 
distance £ from the interval [a, b], is 0((b — a) 2 + £). But since r/ ^ B, both 
(6 — a) and £ are bounded by 2D log 2 n, so the mixing time is 0((D log 2 n) 2 ). 
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Thus D 2 log 6 re single-site updates at position i suffice to simulate a single 
column update with very small error e~ n ^ og n \ which is the second term 
in the bound. The factor M comes from a union bound over steps of the 
column dynamics. 

In order to use Claim 4.4 we need to bound v® E (B) (and, symmetrically 
uf°(B)), the probability of the dynamics creating a large gradient. This 
is in general a non-trivial task because it requires detailed non-equilibrium 
information about the contours. However, it is here that our choice of the 
initial distribution uq = fi(- \ A) becomes crucial. Since fi remains invariant 
under any number of steps of M ° E (and of M EO }, we can write, for any s, 

v° E (B) < m(S)/ M (A), (4.10) 

with an identical bound for uf°(B). But the right-hand side here is easy to 
evaluate as it is the ratio of the probabilities of two events in equilibrium] 
In particular, the following straightforward bound is proved in part (c) of 
Lemma C.l in the appendix: 

for some constants a, c > 0. Hence, thanks to the definition of D, 

fi(B)/u.(A) < e -^ lo s 2 ™). (4.11) 



We can now put everything together. For each i G E, the quantity in Q4.9| ) 
is bounded by 

ll|(^OE(i) + ^EO(i)) - + \WIT ~ MIIj ( 4 - 12 ) 

where uf ar denotes the distribution obtained from vq after s steps of the par- 
allel column dynamics. By Claim 4.4 and inequalities Q4.10P and ( |4.11| ), the 
first term in Q4.12D is bounded by e _n ( log n \ which is certa inly o(l/n b ) for 



3.4 



and Remark 



3.5 



any fixed b, while the second term is o(l/n b ) by Theorem 
and the fact that M » n 2 log n (the mixing time of ) . Hence the varia 
tion distance of the dynamics is o(l/n b ), as claimed in the lemma. □ 

4.2. From maximal height to equilibrium. In this subsection we show 
that, after at most 0(n 3 - 5 ) steps, the single-site dynamics starting from 
the maximal configuration (in which the contour has height n everywhere) 
reaches equilibrium. For convenience we will work throughout this subsec- 
tion with a slightly modified model in which the set of allowed heights is N 
rather than [0, re]. The equilibrium distribution fj, for this model is defined 
exactly as in Q1.1D , where the partition function Zp is appropriately defined. 
(Note that Zp remains bounded for any j3 > 0.) We will show that the vari- 
ation distance between the contour at height n and the equilibrium contour 
in this model becomes very small in 0(n 3,5 ) steps. This immediately im- 
plies the same result for our original model with height set [0, re] because 
of monotonicity and the fact that the variation distance between the two 
equilibrium distributions is exponentially small in re (see Remark C.2| in the 
appendix). We will use f2^° to denote the set of configurations with height 
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set N. We note also that our basic building block, Lemma 4^2 is easily seen 
to hold in this setting also. 

The main ingredient in this subsection is the following lemma, which says 
roughly that an initial contour at height h < y/n drops to height approxi- 
mately h/2 after 0(n 3 ) steps. 



Lemma 4.5. Let C(/3)logn < h < -Jn, where C(/3) is a specific constant 
depending only on (3. Let vt he the distribution at time t of the single-site 
dynamics started from /i conditioned on the event Ah := {rj(i) >hVi€ [1, n]}. 
Then there exists a time t n = 0(n 3 ) such that, for any increasing function 
f: m. R with ||/||oo<l, 

v tn {f)<»{f\A h/2 )+o{l/n), (4.13) 

where the term o(l/n) is independent of f. 

Proof. In the proof, we will make use of the single-site dynamics on the 
enlarged interval [— £ + l,n + £], with boundary conditions at positions —£ 
and n + £+l. The parameter £ < n will be chosen later. We may construct a 
monotone coupling of this dynamics with our original one by choosing the 
position i to be updated from the enlarged interval [—£ + l,n + £], and doing 
nothing in the original dynamics if i ^ [l,n]. Plainly this slows down the 
original dynamics by at most a factor of 3 and so does not affect our results. 

Now consider the enlarged dynamics started in its equilibrium distribu- 
tion p& conditioned on t he ev ent A^. Denote its distribution at time t by 



vf 1 . By part (b) of Lemma C.l in the appendix, we have 



cn a 



for constants a, c > 0. Moreover the event Ah is clearly increasing and there- 
fore Lemma [4~2] applied to the enlarged dynamics implies that for a suitable 
time t = 0(n 3 ^) the variation distance between vf 1 and ^ is o(l/ 



If we now set the free parameter £ equal to Le§^Jj where iS < 1 will be 

fixed later, we have that t = 0(n 3 ). (Note also that £ < n, as stipulated 
earlier; this is why we require the upper bound on h.) Thus we may take 
t n := t = 0(n 3 ) and get that, for any / as in the statement of the Lemma, 

VtSf) < 4 £ J(f) < ^ ] U) + o(l/n). (4.14) 

We now bound //W(/). Let Eh = {rj G O^ 3 : max(^(l), rj(n)) < h}. Then 

^(/) < ^(f\E h/2 )+^(El /2 ) 

< Kf\ A h/2 )+^(E c h/2 ). (4.15) 

Now by part (a) of Lemma C.l in the appendix, provided h/2£ < (3/2 we 
have 



16 



FABIO MARTINELLI AND ALISTAIR SINCLAIR 



for constants a, c > 0. By choosing the constant 5 in our definition of t 
small enough, we can make this latter quantity o(l/n), which via Q4.15D 
and Q4.14D yields the desired bound ( |4.13[ ) . Finally the condition h/2£ < 
j3/2 translates to h > js \ogn, i.e., h > C(/3) log re. □ 

A simple iterative application of Lemma |4.5| yields the following: 

Corollary 4.6. In the setting of Lemma 4.5 for any integer j such that 2~ J 7i > 
C(/3) logn we have 

VjtnU) < M(/ I A h/2j) + o(j/n). 

Proof. Write v[ h ' for the distribution of the single-site dynamics at time t, 
started in \ Ah). Let /tf-i) be the function on obtained by applying 
the transition matrix of the single-site dynamics (j — l)t n times to the orig inal 



function /. Clearly fv ^ is still increasing with 1 ^ 1 1 < 1, so Lemma 4.5 
yields 

= v t\f {j ~ 1] ) < I A h/2 ) + o(l/n) 

= ^ 2 lk(/) + °( 1 /")- 
Iterating over j completes the proof. □ 

We are now in a position to prove our first main result, which says that 
the mixing time of the single-site dynamics starting at height ^fn is 0(n 3 ). 

Theorem 4.7. Let A = {r] £ Q£° : r](i) > ^/n Mi e [l,n]} and let u t be 
the distribution at time t of the single-site dynamics started in the distribution 
| A). Then for some time t n = 0(n 3 ) we have 

Ihn - Mil = o(l/ y/n). 
Proof. The event A is increasing, so the relative density between the initial 
distribution and the equilibrium one given by g(r/) := = is 

also increasing. As shown in Il20ll the same holds for gt{jf) = v t (rf) j /i{r\) ■ In 
particular the e vent Ut = {rj : vtiji) > A i ( r /)} is increasing. 

By Corollary 4.6 with j = O(logn) there exists a time s = 0(n 3 ) such 
that 

< M(/ I ^[C(/3)io g nl) + o((logn)/n) 
for any increasing function / with ||/||oo < 1, where A^ is defined as in 
Lemma 4.5 Thus, for any t > 0, we can bound the probability v s+t (U s+ t) 
by 

v s+t (U s+t ) < v t {U s+ t) + o(l/y/n), 
where u t is the distribution of the Glauber chain at time t started from the 
equilibrium distribution \x conditioned on the event ^4|"c(^)iogn]- h follows 
from the proof of part (b) of Lemma C.l|that log(-7T — A = Oflog n). 



Therefore, by Lemma 4.2 we have that, for some t = 0(n 



h{U s+ t) = n(U s+t ) + o(l/n). 
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In conclusion, setting t n := s + t = 0(n 3 ) we get 

lh„ - HI = n n (U tn ) - n(U tn ) = o(lVn), 
which completes the proof. □ 

A simple iterative application of the above theorem shows that, starting at 
height n, the single-site dynamics reaches equilibrium with a further 0(y/n) 
factor overhead, i.e., in total time 0(n 3 5 ). 

Theorem 4.8. Let B = {77 G : rj(i) > ji Vi £ and let u t be the 

distribution at time t of the single-site dynamics started from u{- \ B). Then 
for some time t n = 0(n 3 5 ) we have 

Ihn -mII = 0(1). 

Remark 4.9. By monotonicity, the theorem immediately implies the same con- 
clusion for the single-site dynamics started in the (maximal) configuration 
77(7) = nfor all i E [1, n]. 

Proof. Let us define a new height set := [n — y/n, 00], let Q$ := {77 E 
: r/(i) G i/^ 1 ** Vi G [1, n]} and let /iW be the equilibrium distribution on 
VLn^ given by //(• | fin^). Let z^ 1 -* be the distribution at time t of the single- 
site dynamics on Q$ started from \ B) and with boundary conditions 
r/(0) = 77(71 + 1) = n — y/n. 

By monotonicity and Theorem 4.7 applied to up we have that, after time 



t = 0(n 3 ), for any increasing function / with ||/||oo < 1 3 

vtU) <^\f) < ^Hf)+o(l/Vn). 

If we now define H^> := [n — 2^/77,00] and , /j( 2 ) analogously, we get 
that after a further t steps the distribution i>n of the original chain satisfies 

Mf) < ^ 2) U) + o(l/Vn) + o(l/Vn). 
Iterating y/n times shows that at time t n := y/nt = 0(n 3 5 ) we have 

u tn (f)<Kf) + o(l). 
The proof is complete once we apply the above inequality to the indicator 
of the increasing set U = {7/ G : vtniv) — M 7 ?)}- ^ 

4.3. From zero height to equilibrium. In this subsection we prove the 
complementary fact that the single-site dynamics starting from the minimal 
configuration (in which all heights are zero) reaches equilibrium in 0(n 3 ) 
steps. In our argument we will make use of auxiliary versions of the dynam- 
ics in which certain heights are fixed to be zero. Specifically, for any inte- 
ger m, let M^m denote the single-site dynamics defined as before, except 
that 77(7) is constrained always to be zero for i G {jm : j = 1, 2, . . . , \n/m\ }. 
Let // m ) denote its stationary distribution. Clearly ^' m "> is equivalent to 
/i(- I A m ), where A m is the event that 77(7) = at the above positions i; 
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moreover, /x( m ) is a product of [n/m\ +1 unconditioned SOS Gibbs distri- 
butions each on an interval of length (at most) m — 1. 

The idea in the proof is to control the evolution of the contour by cou- 
pling it with the sequence of dynamics M s ^ m for m = 2, 4, 8, . . so that the 
number of positions with height fixed to zero is successively halved. At each 
stage in the sequence, we will allow A4^ m to reach its equilibrium distri- 
bution // m ). Initially, in the minimum configuration, all heights are zero; 
ultimately we will reach equilibrium with no heights fixed to zero, which is 
our desired SOS equilibrium distribution /i. 

The main ingredient in our proof is the following lemma, which says that 
if we start in the equilibrium distribution (with every mth height fixed 
to zero), then after 0(n 3 ) steps of M^ 2m we wm reach the equilibrium 
distribution ^( 2m ) (with every (2m)th height fixed to zero). 

Lemma 4.10. Let v t denote the distribution at time t o/7W^ s 2m started in the 
distribution //( m ) = ^ 2m \- \ A m ). Then for some time s n = 0(n 3 ) we have 

IK-M (2m) ll=o(l/n). 

Proof. Note that we can view -M^ s 2m as a collection of r = [n/(2m)\ inde- 
pendent standard dynamics on intervals of length 2m — 1, with zero bound- 
ary conditions at positions 2mj,2m(j + 1). Let us focus on the dynamics 
restricted to one such interval Ij. Let j~L denote the stationary distribution 
within Ij, and A the event A m restricted to Ij (i.e., A is just the event 
r/(2mj + m) = 0). Clearly A is a decreasing event, and standard random 
walk arguments [26] imply that fi{A) > -m~ c for a constant c > 0. Hence a 



slight modification of Lemma [4^2 applied to the dynamics within Ij implies 
that, after t = 0(m 3 log 10 n) steps of this dynamics, the variation distance 
from fx is o(l/n 2 ). (The presence of logn rather than logm here is to ensure 
a variation distance that depends on n.) 

Returning now to the full dynamics -M^ s m , suppose we execute suffi- 
ciently many steps T that at least the above number t updates are per- 
formed within each interval Ij for j = 1,2, ... ,r. Since /Z 2 " 1 ) is a prod- 
uct distribution, this will ensure that the variation distance || vj- — /i( 2m )|| is 
o(r/n 2 ) = o(l/n). But by a Chernoff bound it suffices to take T = 2tr = 
0{nm 2 ) = 0(n 3 ) in order to ensure the above condition with probability 
1 — exp(— r2(log 10 nj) = 1 — o(l/n). Taking s n = T completes the proof. □ 

An iterative application of the above lemma now proves the main result 
of this subsection, which is the analog of Theorem 4.8 starting from the 
minimal configuration. 

Theorem 4.11. Let v t be the distribution at time t of the single-site dynamics 
started from the minimal configuration. Then for some time t n = 0(n 3 ) we 
have 

Ihn - HI =o(i). 
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Proof. Let vt be the distribution at time t of the following dynamics, starting 
from the minimal configuration. For the first s n steps (where s n = 0(n 3 ) is 



as defined in Lemma 4.10 ), run the dynamics M^ 2 ', f° r the next s n steps run 
the dynamics M^^, and so on (i.e., run s n steps of each dynamics M. s * 2j 
for j = 1,2,...). Note that the d istribution of the initial configuration is 
exactly /Z 1 ). Thus by Lemma 4.10 we have \\i> Sn — = o(l/n). Similarly, 



fori = 1,2,... 

t n := [logn]s n 



applying Lemma 4.10 iteratively implies that 

Since for m > n, we may set j 

0(n 3 ) to conclude 

IP*™ - Mil = o((logra)/n) = o(l). 

Finally, note that by monotonicity we have u t -< v t -< fi for all t. Hence Q4.16| ) 
implies \\u tn — fi\\ = o(l), as required. □ 



[logn] and 



(4.16) 



4.4. Proof of Theore m |4.1[ The p roof of Theorem 4.1| is now easily de 



duced from Theorems 4.8 and 4.11 Let z/™ ax and f t mm denote the distribu- 
tions of the single-site dynamics at time t starting in the maximal and mini- 
mal configurations respectively. By Proposition 2.1| to prove the theorem it 
suffices to show that for some time t = 0(n 3,5 ) we have ||^ max — i/' nm || = 
o(l). But Theo rem 4.8 shows that, for such a t, \\vf 

o(l/n). Hence Hff 1 ^ 



Theorem 4.11 that 



completes the proof. 



•Ml 



- Mil 

,min I 



o(l/n), and 
o(l), which 



Remark 4.12. It should be clear that our result generalizes to the SOS dy- 
namics on a region [1, n] x [0, h], in which the allowed height set is [0, h]. The 
mixing time for the single-site dynamics is then 0(max{n 3 , n 2 5 /i}). The only 
difference from the h = n case is in the analysis of the maximal contour: by an 



argument analogous to that in Theorem 4.7. we achieve a height reduction of 
Fn in 0(n 3 ) steps, yielding the above bound. 

Appendix A. Proofs of Lemma [3721 and Corollary |3.3| 



Proof of Lemma 3.2 Abusing notation, we write -q for 7] t +i (i) and abbreviate 
E[ • | a, b] and Pr[ • | a, b] to E[-] and Pr[-1. We also introduce the notation 



-2/3m- 



, where S a 



-26 



/(I' 



S m for the sum Y^ =1 
and So = 0. Note that the normalizing factor in \2.2\ can then be written as 



Z 



a-l 

E 

3=0 

-B(b-a) 



e -B(b-a)-2/3(a-j) + ^ e ~P{b-a) 



+ 



n 

E ' 

j=b+l 



-8{b-a)-2p{j-b) 



3=a 

e ">(S a + (b-a + l) + S n - b ). 
Our goal is to evaluate E[rjt+i(i) \ a, b] = E[ry], which we may write as 
E[rj] = Pr[r] < a + b] E[rj \ rj < a + b] + Pr[r? > a + b] E[rj \ rj > a + b]. 
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Since the conditional distribution Pr[- | rj < a + b] is symmetric w.r.t. to the 
point ^ then E[rj \ r] < a + b] = (a + b)/2, while 

E[r)\ V >a + b] = ( Y je- 2 ^ +f}{a+b) \ I [ Y e~ 2 ^^ 

\j=a+b+l } 

(n-(a+b) 
Y (a + b + k)e-^ k 
k=l 

= a + b + T n _( a+fc ), 

where T m := (J2™ = i je- 2Pi )/S m = 1/(1 - e~ 2 ^) - me- 2 ^ m /(l - e" 2 ^). 
Therefore, we have 

i a + b , , . 
m = — +e(a,b) 

where 

e(a, b) = Pr[7? > a + b]((a + b)/2 + T n _ {a+b) ). (A.17) 

Since plainly e(a, b) > 0, this gives the first part of the lemma. 

To prove the second part, we claim it suffices to show that e(a, b) is a 
decreasing function of a and of b (subject to a < b and a + b < ri). To see 
this, consider any pair (c, d) such that c < min{a, d} < max{a, d} < b. (Note 
that this also implies c + d < n.) If a < d then using monotonicity in b and 
then in a we have e(a, b) < e(a, d) < e(c, d); if on the other hand d < a then 
we have similarly e(a, b) < e(c, b) < e(c, d). 

To show that e is decreasing with b, note first that 

Pr[f] > a + b\ — 



S a + (b-a + l) + S, 



n—b 



Plugging this into ( |A.17[ ) and differentiating w.r.t. b gives 

d d 
-e(a,6) = (i-i;_ (a+6) )Prfo > a+b]+((a+b)/2+T n _ {a+b) )- 



(A.18) 

where Pr[?7 > a + b] is given by the expression 

-2Pa~ S 'n~(a+b)( Sa + ( b ~ + 1) + S n - b ) - (1 - 5^_ 6 )5 n _( 0+6 ) 
[S a + (b - a + 1) + £„_ 6 ] 2 ' 

(Here we are viewing 5 m and T m as continuous functions of the parame- 
ter m.) Note that S' is non-negative, and that 

5 , .^_( a+ fc)5 , 7i-6 > ^-^^-(a+fe) (A.19) 

because 

S' n . (a+b) S n - b = 2(3Sle-^-^(l-e-^-% 
S' n -bS n - {a+b) = 2pSle-^- b \l-e-^-^)). 
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Therefore, 



36 " [S„ + (6-a + l) + S„_(,] 2 S„+(6-a + l) + S, 



n—b 



Plugging this into ( A.18 ), and noting that T' is also non-negative, we get 

» < i Prh s ,;; t _ + ^v ) ( rL ) • (A ' 20) 

We will therefore be done if we can show 

a + b + 2T n _ (a+b) > S a + (6 - a + 1) + S n -b- 

We do this in two steps, as follows: 

a + b + 2T n _ (a+6) > a + b + 1 + S n _ (a+6) > S a + {b - a + 1) + S„_ 6 . (A.21) 

Note that we may assume n — (a + b) > 1 since if a + b = n then e(a, b) = 0. 

Proof of first inequality in QA.21[ ): Setting m = n — (a + b), we need to show 
that 

S m + 1 < 2T m for 1 < m < n. (A.22) 
Recall that S m = 5^(1 - e~ 2/3m ), where 5^ = e" 2/3 /(l - e" 2/3 ), and that 
T m = Y^jLij e ~ 2 ^^/^m- A simple calculation shows that 

1 me- 2/3m 
T — 

m l _ e -2/3 l_ e -2/3m- 

Thus, writing x = e~ 2/3 , the desired inequality in QA.22[ ) becomes 

x mx 2 2mx m 

-l-x m ) + l< forO<x<l. 



1— X 1 — X 1 — X T 

Rearranging yields the equivalent expression 

1 - (2m + l)x rn + (2m + l)x m+1 - x 2m+1 > 0. (A.23) 
Differentiating the left-hand side w.r.t. x gives 

- m(2m + l)^" 1 + (m + l)(2m + l)a; m - (2m + l)x 2m 

= -(2m + l)x m ~ 1 [m- (m + l)x + x m+1 ]. (A.24) 

But the function /(x) := m — (m + l)x + x m+1 is zero at x = 1 and monoton- 
ically decreasing for < x < 1, so is always non-negative on this interval. 
Therefore, the derivative in QA.24| ) is non-positive. Thus the left-hand side 
of QA.23P is a non-increasing function of x, and is zero at x = 1; hence the 
inequality holds, and ( |A.22[ ) is proved. 

Proof of second inequality in ( |A.21[ ): We need to show 

S a + S n -b — 5 n _( a+fc ) < 2a. (A.25) 
The left-hand side here is equal to 

Soo (l - e - 2 ? a + e -mn-b)( e 2p a _ 
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For any fixed a > 0, this expression is maximized by taking n — b as small as 
possible, which means n — 6 = a + 1 (since we are assuming a + b < n — 1). 
Thus the expression is bounded above by 



e 



Plugging this back into QA.25D means we need to prove 25oo(l — e 2/3a ) < 2a, 
or equivalently 



p-2/9 



and also the proof that Jje(a, b) < 0. 

We now carry out a similar computation for ^e(a, 6) to get 



1 - e- 2 / 3 

Writing x = e~ 2/3 and rearranging gives the equivalent inequality 

a- (a + l)x + x a+1 > for0<x<l. 

But the function on the LHS here is precisely the function f(x) in the pre- 
vious part of the proof (with m replaced by a), so we know that it is non- 
negative throughout the desired interval. This completes the proof of QA.25| ), 

db f 

da c 

—e(a,b) = (h-K_ {a+b) )PT[r, > a +b]+((a+b)/2+T n _ {a+b) )—Pr[ V > a+b], 

(A.26) 

where 

d 

— Pr[?7 > a + b] = -2/3 Prfr/ > a + 61 + 
oa 

-2/3a - S 'n-(a+b) (S a + (b - a + I) + S n - b ) - (S' a - l)S n _ {a+b) 

[S a + (6 - a + 1) + S n . b } 2 • 
Using QA.191 ) and the fact that S' is non-negative, we get 

|- Prfr/ > a+b] < Pr[r? > a+6] (-2/3 + - 1 . " Sa ~ f"- fc ) . (A.27) 
da V 6 a + (6 - a + 1) + 6„_ fe y 

We claim that the term in parentheses here is bounded above by — (3. To 
see this, note first that S' m = 2/3(S' 00 — S m ) for all m > 0. Thus the term in 
parentheses can be written 

-4/3^ - 2/3(6 - a + 1) + 1 
S a + 5„_ fe + (6 - a + 1) 
This is bounded by — /3 provided 

4/35oo + 2/3(6 - a + 1) - /3(5 a + 5 n _ b + (6 - a + 1)) > 1. 

Using the facts that Soo > S a , S b , and that 6 — a + 1 > 1, is is sufficient to 
show 

/3(25oo + 1) > 0. 

Plugging in Soo = e~ 2 ^/(l — e _2/3 ) and rearranging, this is equivalent to 

/(/3) := (/3 - l)e 2/3 + /3 + 1 > 0. 
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The function / is at (3 = 0, and its first and second derivatives are e 2l3 (2(3 — 
1) + 1 and 4/3e 2/3 respectively. Since the first derivative is at (3 = 0, and 
the second derivative is non-negative for all (3 > 0, / must indeed be > 
throughout this range. 

Replacing the parenthesis in QA.27D by the upper bound — (3, and plugging 
this into QA.26P , we get 

— e(a,b) < \YT[r 1 >a + b]{l-2T' n _ {a+b) -P{a + b + 2T n _ {a+b) )^ 
< \ Pr[r? > a + b] (l - 2T' m - 2f3T m ) , 
where m = n — (o + b). We will thus be done if we can show 

Substituting for T m and T' m , writing y = 2(3m and rearranging, we see that 
this is equivalent to 

But since /3/(l - e _2/3 ) > 1/2 for all (3 > 0, it is sufficient to show that the 
function g(y) := y — 2 + (2 + y)e~ y is non-negative for all y > 0. But <?(y) 
is zero at y = 0, and its first and second derivatives are 1 — e _?/ — ye~ y and 
ye~ y respectively. Since the first derivative is at y = and the second 
derivative is non-negative for all y > 0, g must indeed be non-negative for 
all y > 0. 

This completes the proof that J^e(a, b) < 0, and hence also the proof of 
the lemma. □ 

Proof of Corollary \3. 3\ Note first that, by symmetry, if a + b > n then we get 
a complementary statement to Lemma 3.2 with e(a, b) replaced by —e(b', a 1 ), 
where a! = n — a, b' = n — b. (This corresponds to exchanging the roles of 
the top and bottom barriers.) We shall use both versions below. 

Consider first the case a + b < n and c + d < n. Note that r/ t < 6 implies 
that c < min{a, d} < max{a, d} < b. Then we have 

E[&+i(i) | a, b] - E[q t+1 (i) \c,d} = - + e(o, b) - e(c, d), 



and by Lemma 3.2 we can conclude that e(a, b) < e(c, d), as required. 

Now consider the case a + b > n and c + d > n. In this case we can change 
variables to r/(i) = n — rj(i), = n — ^{i), a! = n — b,b' = n — a, c' = n — d, 
d' = n — c. Note that a' < minj^', c'} < max{6', c'} < d'. Then we have 

Efo+i(t) | a, b] - E[ Vt+1 (i) \c,d} = E[ V ' t+1 (i) | c', d'\ - E[^ m (i) | a', 6'] 

a + b c + d / / ,/n 

= — — + e(c',d')-e(a',b'), 



and by Lemma 3.2 we can conclude that e{d d') < e(a', b'), as required. 
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Finally, if a + b > n but c + d < n then we apply the above change of 
variables only to rj, a and b to get 

E[6+i(«) I a M -Hm+iii) \c,d] = n-E[& +1 (i) \ b',a] -E[rft+i(i) \ c,d] 

a+b c+d 

= — j g e(« ,o) -£(c,d), 

and use the fact that both e(a', 6') and e(c, d) are non-negative. □ 

Appendix B. Proof of Claim l4~4l 

Let jl be any probability distribution on J7 ra or ^1°°^, and let w = {ii, . . . , i^} 
be a sequence of even positions in which every even position i G [l,n] ap- 
pears at least D 2 log 6 n times. Let ji ewen be the new distribution obtained by 
applying to jl one parallel update E of the even positions, and /2 W the dis- 
tribution obtained by applying to jl the single-site updates at the sequence 
of positions w. 

We claim that 

Uneven " £w|| < fifi) + e""^^, (A.28) 

where the f2(log 2 n) term is independent of jl. An entirely analogous state- 
ment will clearly also hold for a parallel update O of the odd positions. This 
will complete the proof by a simple induction over the number of epochs M, 
since it ensures that every step of -M^ E and (the constituent chains 

of Mn^) is simulated correctly with error at most jl{B) + e~ n ( log n \ where 
jl is the current distribution of the respective chain and clearly satisfies 
jl(B) < max s K OE (5) + vf°(B)}. 

To prove ( |A.28| ), let r/ £ B be a configuration without large gradients, 
which accounts for the term jl{B) on the right-hand side. We need to show 
that, for each even position i e [1, n], applying at least D 2 log 6 n single-site 
updates to i is equivalent to applying one column update at i, except for an 
error e~ n( > log n \ Recall that a column update at i replaces the height rj(i) by 
a random height drawn from the distribution Q2.2| ). Note that this distribu- 
tion is uniform on the interval [a, b], where a = min{r/(i — 1), rj(i + 1)} and 
b = max{rj(i — l),rj(i + 1)}, and decays exponentially outside this interval 
(at a rate that depends on jS). On the other hand, under the single-site up- 
dates the height at i performs a nearest neighbor random walk on N that 
is reversible w.r.t. this distribution, and of course converges to it. If we can 
show that the random walk starting at rj(i) is very close to equilibrium after 
D 2 log 6 n steps, we will have proved the Claim. 

Now we claim that the mixing time of the above random walk, starting 
at a position at distance £ from the interval [a.b], is 0((b — a) 2 + £). To see 
this, note that the random walk is symmetric on [a, b] and has a uniform 
drift towards this interval from everywhere outside it. The term 0(£) in the 
mixing time reflects the time to reach [a, b], which is linear because of the 
drift, and the term 0((b — a) 2 ) is the mixing time starting inside the interval, 
which is essentially the same as that of symmetric random walk on [a,b]. 
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This can be verified formally by showing that the walk started at distance t 
from [a, b] can be coupled with a stationary walk so that the two walks 
coalesce with constant probability in the interval [a, b] after 0((b — a) 2 + £) 
steps. But the fact that rj £ B implies that (b — a) < 2D log 2 n and £ < 
D log 2 n, so the above coupling time is 0((D log 2 n) 2 ). Hence D 2 log 6 n steps 
suffice to simulate a column update with error e" 

-Q(io g n) This com pietes 



the proof of QA.28| ) and hence of the Claim. □ 



Appendix C. Equilibrium bounds 

The lemma below contains bounds on the probabilities of various events 
under the equilibrium distribution \i that are used extensively in Section |4| 
We state and prove the lemma for the case where \i is the equilibrium dis- 



tribution of the SOS model with height set N (which we use in Section pK2j ). 
However, essentially the same bounds hold for the SOS model with height 
set [0, n] once we observe that the variation distance between the two equi- 



librium distributions is exponentially small in n (see Remark C.2 below for 
details). 

Lemma C.l. Let \i be the equilibrium distribution for the SOS model on [1, n] 
with height set N. There exist positive constants a, c such that: 

(a) for anyh>0 and £ e [1, n] with h/l < f3/2, 

H(rj(£) >h)< n a e~ h2 / ce ; 

(b) for any h > 0, < i < [n/2\, and A = {r/ e Q n : > h Vi G 
[£+l,n-£]}, 

^ y ' ~ cn a 

(c) if B = {rj : \rj(i + 1) — rj(i)\ > dfor some i G [0,n]} then 

H(B) < cn a e- d/c ; 

(d) ifC = {Vi : r](i) < n} then 

H(C C ) < cn a+1 e~ n/c . 

Remark C.2. It should be clear that the equilibrium distribution with height 
set [0, n] is nothing other than the distribution /z conditioned on the event C 
above. Thus the variation distance between these two distributions is no larger 
than 2[i(C c ), which by part (d) of the lemma is exponentially small in n. In 
particular the bounds (a) and (c) above hold (possibly with different constants 
a', d) for height set [0,n]. The lower bound (b) also holds for height set [0, n] 
under the additional restriction that h? jt ^ n (so that the variation distance 
between the two distributions is negligible compared to the r.h.s. in (b)). 

Proof. 

(a) Let Zq = 0, and let {Zj}j>± be i.i.d. geometric random variables with 
random signs, so that Pr[Zj = z] = Pr[Zj = —z] oc e~@ z . Define X(i) := 
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S}=o to ^ e symmetric random walk on Z started at the origin whose 
increments are the Zj. 

Then the SOS equilibrium distribution fi on [l,n] with height set N can 
be written as 

fi(rj) = Pt[{X = n | X(n + 1) = ; X(i) > V* € [l,n] ]. 
Standard random walk bounds show that 

Pr[X(n + 1) = 0, X(i) > OVi G [l,n\] > l/n a 

for some constant a. Therefore, by Markov's inequality applied to the ran- 
dom variable exp(A J2i Zi), for any A G [0, /3) we have 

H(r](£)>h) < n a Pv[X(£)>h] 

< n a e- Xh E(e XZl ) e 

< n a e- Xh (l + ^A 2 E(e A|Zl| Z 2 )) £ , (A.29) 

where we used the inequality e Xx < 1 + Ax + I,\ 2 x 2 e max ( A:r ' ) together with 
the fact that E(Zi) = 0. 

Now choose A = Sh/£ with < <5 < 1. Since A < (3/2 the term E(e A l Zl IZ 2 ) 
is bounded by a constant c = c(/3) so that 

By choosing 5 small enough we complete the proof of part (a). 

(b) With the same notation as in part (a), and using the FKG inequality we 
may write 

K A ) > Yl Pr t x W >0Vi^[£+l,n-£}; X(£+l) = h\,X(n - I) = h 2 \ 
hlM - h X(0) = X(n+1) = 0] 

x Pv[X(i) >h\/i e [£ + l,n-£] \X(£ + 1) = h h X(n-£) = h 2 ] 

>Pr[X(i) > OVi G [0,£]; X(£ + l)>h \ X(0) = 0] 2 

x Pv[X(i) > OVi G [0,n + 1 - 2£] | X(0) = X(n - £) = 0] 

> ^Pv[X(£)>h\X(0) = 0} 2 , 

where in the last step we used standard random walk estimates Il26l to get 

Pt[X(i) > OVi G [0,n] | X(0) = 0] > 1/n 7 

for some constant 7. The proof is easily finished by standard techniques for 
proving large deviation lower bounds for sums of i.i.d. random variables. If 
h/£ > 1 we can write 

Pr[X(£) > h I X(0) = 0] > Pr[X(£ - 1) > Q;Z e > h \ X(0) = 0] 
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for a suitable constant c. If instead hjl < 1 we introduce the tilted distribu- 
tion Pr A (Zi = z) oc e -P\ z \+ Xz with A such that E A (Zi) = /i/t It is easy to 
see that A = 0(h/£). Using Jensen's inequality we can write 



Pr[X(£) > h | X(0) = 0] > E(e AZl /E A (e- A ^=i Zl xE^ G [/i,2/i] 

*Pr A Zi e [h, 2h] 



> e -2\h-D„x | 



> e 



i=l 



for another suitable constant c. 

(c) It is enough to write 

fJ,(B) < .ma.— Pr(|Zt| , < 

PV 7 ~ i Pr[X(i) > OVi G [l,n] ; X(n) = | X(0) = 0} ~ 

again by standard random walk bounds. 

(d) After a union bound over the index i 6 [1, n] it is enough to repeat the 
arguments used in the proof of part (a) up to QA.291 ), and then choose the 
free parameter A small enough (independent of ri). □ 
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